Nearest Neighbour Classification with Background Knowledge Extended to Semi-supervised Learning
نویسندگان
چکیده
Semi supervised methods involve converting unlabelled data into high quality labelled data that can be used to improve the performance of conventional supervised methods that had previously been given a small training set. Unlabelled data has also been shown to be helpful in a supervised setting called ‘bridging’ where unlabelled data have been used to help relate labelled instances to those that are being classified. In this paper, we propose a new supervised bridging method that can be used to improve existing semisupervised methods in certain problem settings.
منابع مشابه
Local nearest neighbour classification with applications to semi-supervised learning
We derive a new asymptotic expansion for the global excess risk of a local k-nearest neighbour classifier, where the choice of k may depend upon the test point. This expansion elucidates conditions under which the dominant contribution to the excess risk comes from the locus of points at which each class label is equally likely to occur, but we also show that if these conditions are not satisfi...
متن کاملEnhancing the Performance of Semi-Supervised Classification Algorithms with Bridging
Traditional supervised classification algorithms require a large number of labelled examples to perform accurately. Semi-supervised classification algorithms attempt to overcome this major limitation by also using unlabelled examples. Unlabelled examples have also been used to improve nearest neighbour text classification in a method called bridging. In this paper, we propose the use of bridgin...
متن کاملScene Classification Via pLSA
Given a set of images of scenes containing multiple object categories (e.g. grass, roads, buildings) our objective is to discover these objects in each image in an unsupervised manner, and to use this object distribution to perform scene classification. We achieve this discovery using probabilistic Latent Semantic Analysis (pLSA), a generative model from the statistical text literature, here ap...
متن کاملSemi-Supervised Self-Organizing Feature Map for Gene Classification
In this thesis, a study on gene expression data analysis is done using some supervised, unsupervised and semi-supervised approaches. The task of class prediction for six gene expression datasets (namely, Brain Tumor, Colon Cancer, Leukemia, Lymphoma and SRBCT) has been carried out. Here, a one-dimensional self-organizing feature maps (SOFM) in a semi-supervised learning framework is developed f...
متن کاملUnsupervised Feature Learning via Non-Parametric Instance Discrimination
Neural net classifiers trained on data with annotated class labels can also capture apparent visual similarity among categories without being directed to do so. We study whether this observation can be extended beyond the conventional domain of supervised learning: Can we learn a good feature representation that captures apparent similarity among instances, instead of classes, by merely asking ...
متن کامل